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Abstract 

In this work we extend the concept of uplink-downlink rate balancing to frequency division duplex 
(FDD) massive MIMO systems. We consider a base station with large number antennas serving many 
single antenna users. We first show that any unused capacity in the uplink can be traded off for higher 
throughput in the downlink in a system that uses either dirty paper (DP) coding or linear zero-forcing 
(ZF) precoding. We then also study the scaling of the system throughput with the number of antennas in 
cases of linear Beamforming (BF) Precoding, ZF Precoding, and DP coding. We show that the downlink 
throughput is proportional to the logarithm of the number of antennas. While, this logarithmic scaling is 
lower than the linear scaling of the rate in the uplink, it can still bring significant throughput gains. For 
example, we demonstrate through analysis and simulation that increasing the number of antennas from 4 
to 128 will increase the throughput by more than a factor of 5. We also show that a logarithmic scaling 
of downlink throughput as a function of the number of receive antennas can be achieved even when 
the number of transmit antennas only increases logarithmically with the number of receive antennas. 
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I. Introduction 

In recent years, as cellular networks evolve, there has been increasing emphasis on the need for 
efficient transmission schemes. Next generation networks are required to deliver more throughput, 
e.g., increase their spectral efficiency, but at the same time also increase their energy efficiency. 
Massive MIMO, where each base station (BS) has many antennas, promises to bring a significant 
increase in the rates of cellular networks (e.g., [l]-[5] and references therein). It was also shown 
to improve the energy efficiency (EE), at least in some of the scenarios [6], [7]. 

But, the realization of this capacity gain requires the availability of sufficiently accurate channel 
state information (CSI). Hence, much effort has been spent on the analysis and optimization of the 
training signals, channel estimation and CSI feedback (e.g., [8], [9]). Eurthermore, many works 
have observed that the benefits of massive MIMO are limited in frequency division duplex (EDD) 
systems (e.g., [l]-[4]) because of the difficulty to provide CSI to the base station. Thus, most 
of the analysis of massive MIMO system have focused on time division duplex (TDD) where 
the CSI can be measured by exploiting the channel reciprocity. 

In practice, however, the majority of deployed systems employ frequency division duplex 
(EDD). Thus, although the gains are smaller than in TDD, it is still very important to be able 
to characterize the benefits of massive MIMO in EDD systems. In EDD systems, CSI is mostly 
obtained through feedback from the mobile terminals. Thus, much research has been devoted to 
the optimization of the CSI feedback in order to gain the most from the available antennas. 

One approach that has been considered is to exploit knowledge of the spatial correlation in the 
CSI feedback. Large gains were demonstrated using this approach (e.g., [10]-[13]). The channel 
correlation changes quite slowly, and can be measured directly from the uplink transmissions. 
Hence, the knowledge of the channel correlation allows a significant reduction in the required 
feedback rate while achieving the same throughput. However, while the use of spatial correlation 
can increase the downlink rates, it cannot improve the asymptotic scaling of the rates with respect 
to the number of antennas. Hence, in this paper we will study the throughput scaling for the 
case of uncorrelated antennas, while keeping in mind that practical systems should also take 
advantage of the spatial correlation. 

As the CSI feedback requires uplink resources, the use of massive MIMO in EDD systems 
creates a tradeoff between the downlink throughput and uplink throughput. This tradeoff has been 


July 15, 2015 


DRAFT 


3 


investigated in several works. However, the ability to use this tradeoff to dynamieally eontrol the 
balanee between the uplink and downlink rates has been widely ignored. For example, Caire et al. 
[8] and Kobayashi et al. [9] eonsidered several types of CSI feedbaek, and the maximization of 
the downlink throughput for a given amount of overhead (channel uses for training and feedback). 
But, in practical systems the uplink load and the downlink load can differ significantly. Hence, 
the ‘price’ of uplink resources is different for different systems, and may also change with time. 
We also note that the systems in [8], [9] did not specifically consider massive MIMO, and do 
not scale well with the number of antennas. In particular, both works assume that the number of 
users is equal to or larger than the number of antennas. Such a scenario allows a linear growth 
of the throughput with the number of antennas only if the feedback rate also scales linearly with 
the number of antennas (see also [14]). But, as will be demonstrated in Section IV-A, the uplink 
cannot support such a large amount of feedback. 

In contrast to the current literature, our focus here is on the scaling of the throughput with 
the number of antennas in the downlink of FDD massive MIMO systems. In the following we 
show that the downlink sum-rate in an FDD system can scale logarithmically with the number 
of antennas. This is due to the fact that downlink throughput is linearly related to the feedback 
rate (e.g., [8], [9], [14]), but the uplink can only support an increase in feedback rate that is 
logarithmic in the number of base station antennas. 

The logarithmic scaling of downlink throughput in FDD systems is in contrast to the linear 
scaling achievable in TDD systems, where channel reciprocity can be used for downlink channel 
estimation and hence there is no need for CSI feedback. For example, it is shown in [1], [3] 
that the number of served TDD users scales linearly with the number of antennas, and that the 
per-user rate converges to a constant; thus, the total system throughput scaled linearly. Note 
that although the asymptotic user rate depends on various factors (e.g., scheduling, channel 
estimation, pilot contamination, processing type etc.), the throughput scaling remains linear as 
long as the channel coherence block (in time and frequency) is large enough. Some works have 
also addressed the system optimization with ‘relatively small’ number of antennas, and even 
showed that such optimization can significantly reduce the number of antennas that are required 
to achieve a specific throughput gain [15], [16]. However, even in these cases, the asymptotic 
scaling remains linear. 

In this work we adopt the uplink-downlink balancing principle introduced in [17], [18], [19], 
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and use it to characterize the throughput scaling of both link directions in an FDD system as the 
number of antennas grows. The uplink-downlink balancing principle suggests the use of available 
uplink capacity to increase the downlink capacity. The relevance to massive MIMO systems is 
obvious: the uplink enjoys the most gain due to the availability of CSI at the BS. Some of this 
uplink capacity increase can be traded off to also increase the downlink throughput. 

We first demonstrate that any unused uplink rate can be converted to an increase in the 
downlink rate with a favorable exchange ratio. This result can be proven trivially for systems 
that perform dirty paper coding based on the results in [17], [19]. We also provide a novel proof 
that shows that the same exchange ratio is achievable in systems that perform linear zero-forcing 
precoding. 

Subsequently, we also consider a scenario in which a constant fraction of the uplink rate is 
devoted to CSI feedback, and show that the downlink sum-rate can scale logarithmically with 
the number of antennas. Note that this result differs from the results in [17], [19], as the scaling 
here is with respect to the number of antennas, and not with respect to the signal to noise 
ratio (SNR). The logarithmic scaling of the downlink rate is lower than the linear uplink-rate 
or TDD-rate scaling, but still provides good throughput gains, which we further illustrate with 
simulation results. For example, increasing the number of antennas from 4 to 128 can increase the 
throughput by a factor of 5. Furthermore, we also show that this logarithmic scaling on downlink 
throughput can be achieved even if the number of transmit antennas grows only logarithmically 
with the number of base station receive antennas. This important result helps minimize the pilot 
reference symbols required on the downlink and lowers implementation cost and complexity. 

II. System model 

We consider a cellular system with K single antenna mobiles, and one BS with N antennas. 
Assuming a flat fading channel and focusing on the downlink, the symbols that are received by 
all the mobiles at a given time can be written in a vector form as: 

y = ypHx-Mi (1) 

where H is the K x Nt complex valued channel matrix, whose i, j-th element, Hij, is the channel 
gain from the Ath BS to the j-th mobile, x is an iVt x 1 complex vector of the transmitted 
symbols, and n is an J’f x 1 noise vector whose elements are mutually independent proper 
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complex Gaussian random variables [20] with zero mean and unit variance. We assume that all 
BSs antennas are used for reeeption, but not neeessarily for transmission, i.e., we eonsider the 
general ease where the number of transmit antennas may be less than the number of reeeive 
antennas, Nt < N. For normalization, we assume that the power of each channel element is 
£'[|iFjjP] = 1, and p denotes the SNR. The transmitted symbols must satisfy the sum power 
constraint: 

^[iixir] < 1. (2) 

We assume a bloek fading ehannel model, so that H, is assumed to be fixed during the 
transmission of a bloek of T symbols (in different times or frequeneies), but ehanges in an 
independent manner from block to block. The channel elements are further assumed to be 
mutually independent, proper, eomplex Gaussian random variables with zero mean. 

In this paper we foeus on the ease of perfeet ehannel estimation at the mobile receivers, and 
imperfeet (i.e., quantized) CSI at the BS transmitter. Thus, we assume that the receivers ean 
measure the ehannels without errors. Note that the effeets of ehannel estimation, pilot signals 
and pilot contamination can be very signifieant in massive MIMO systems (e.g., [21]-[24]), 
partieularly for the case of short eoherenee time. Yet, as most eellular FDD systems have 
relatively large eoherenee times, this paper foeuses on the ehannel feedbaek, and leaves the 
affeets of ehannel estimation for future study*. 

III. Uplink-Downlink rate tradeoff 

Our main results characterize the sealing of an FDD massive-MIMO system with the number 
of antennas. We show that the number of served users in the downlink can scale logarithmically 
with the number of antennas. This sealing is obtained by using part of the eapaeity gain of the 
uplink (due to the use of massive-MIMO) to send a large amount of feedback in the downlink, 
using the uplink-downlink balancing concept. These results give motivation for the utilization 
of massive-MIMO in FDD systems, and not only in TDD systems. 

*It is also important to note that the presented rate scaling result is achievable when the number of transmit antennas scales 
logarithmically with the number of receive antennas. Thus, the required number of pilot symbol will typically be quite low 
compared to the channel coherence time, and the challenges of channel estimation are not very large. 
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We start by showing that any un-needed rate in the uplink of eaeh user ean be traded into 
a downlink rate, with a good exehange ratio. We define Fj as the total feedbaek rate sent by 
mobile i (over the uplink) and Ri as the downlink rate aehievable by mobile i. 

Theorem 1 (Massive-MIMO uplink downlink balancing): In a system that uses either dirty 
paper eoding (DP) or linear zero-foreing preeoding (ZF), the ratio between the downlink rate 
and the eorresponding feedbaek rate at the interferenee limited regime is lower bounded by: 


. 1 Ri T 

hm hm — > — 

Fi^ao p^oo Fi Nt 


( 3 ) 


Proof: The proof for a system that applies DP is a straight forward adaptation of Theorem 
1 in [17], [19] to the ease of Massive MIMO. The proof for a system that applies ZF is given 
in Subseetion IV-A. ■ 

The exehange ratio of Theorem 1 (T /Nf) is favorable as long as the size of a ehannel eoherenee 
bloek (in symbols) is larger than the number of transmit antennas. This is good news beeause 
in many eellular seenarios the eoherenee bloek ineludes hundreds of symbols or more (see [19] 
for an example). 

Interestingly, the theorem holds for eaeh mobile separately, i.e., the uplink-downlink ratio ean 
be balaneed for eaeh mobile separately aeeording to its needs and eapabilities. Nevertheless, in 
the rest of the paper we foeus on the ease where all users have identieal serviee requirements 
and eapabilities, and study the sealing of the number of supported users with the number of 
antennas. 

Theorem 1 deseribes the uplink downlink tradeoff for a given number of transmit antennas. 
We next turn to study the sealing as the number of antennas grows to infinity. As the uplink 
user rate seales only logarithmieally with the number of antennas, the available feedbaek is not 
suffieient to allow linear sealing of the number of users with the number of antennas^. In the 
following we show that using either a DP seheme, a linear beamforming seheme or a linear ZF 
seheme ean allow a logarithmie sealing of the number of users with the number of antennas, 
while keeping a eonstant rate per user. This sealing is summarized by the following theorem: 

Theorem 2 (Linear precoding): At the massive MIMO asymptote, devoting a fraetion, Cf, of 
the uplink rate of eaeh mobile to feedbaek, for any Cu > 0, a system that employs linear preeoding 


^For example [8] (using linear ZF) and [17] (using DP) required a feedback rate per user that grows linearly with the number 
of transmit antennas. 
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or DP can support K = log 2 N mobiles while providing a eonstant rate for eaeh mobile that 
is lower bounded by: 

P C{T 


lim Ri > log 2 

N^OO 


1 + 


( 4 ) 


1 + p Cu log 2 e 

The same number of users ean also be supported with only Nt = Ct log 2 N transmit antennas, 
for any Ct > 0 in the ease of linear beam-forming preeoding (BF) and for any q > in the 
ease of ZF or DP. In sueh ease, the rate of eaeh mobile ean be bounded by: 

/ f Ct f I _ \ 


lim Ri > log 2 

N—>oo 


1+p- 


t-^){ 


_CfT 


V 


A 


( 5 ) 


/ 


1 + p(l -/3) + p/32 “ 
where /3 = 0 for BF and /3 = 1 for ZF or DP, while A = 0 for BF and ZF, and 

(1 + l0g2 (l - 


A = 


7 


log2(e) > 0 


( 6 ) 


for DP, with 


7 = p 


CfT 

1 - 2 ^ 
liA ■ 

1 + p2 


( 7 ) 


Proof: See Seetion IV-B. ■ 

The bounds of Theorem 2 are appealing beeause they show a signifieant rate inerease in the 
downlink of an FDD system through the use of massive MIMO. Furthermore, the ability to 
achieve this sealing while the number of transmit antennas seale only logarithmieally with the 
number of reeeive antennas (even while only using beam-forming) signifieantly reduees the 
implementation eomplexity. Note that the large number of reeeive antennas is still needed, in 
order to inerease the uplink eapaeity and allow the required sealing of the feedbaek rate. 

Note that no seheme is uniformly best. As A is monotonie inereasing with we see that A 
is lower bounded by 0 and henee, DP is (asymptotieally) always better than ZF^. The relation 
between the DP and the BF sehemes is more diffieult to eharaeterize. However, one ean easily 
verify that the BF seheme is asymptotieally preferable over the ZF seheme if and only if: 


£.>(1_2-7) " 

Ct 1 -f p 


( 8 ) 


^Recalling that for DP and ZF Cu ^ Ct, and also noting that 7 < 1, we also have that A <^log,(l + ^)-toggle) < 
2 — logjle) = 0.56, and hence the difference between the two schemes is bounded. 
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Since the DP seheme outperforms the ZF seheme, Condition (8) is not suffieient to show that 
BF is preferable to DP. Yet, an inspeetion of (5) shows (similarly to (8)) that BF is preferable 
over DP if the SNR, p, or the feedbaek fraetion Cf are small enough, or if the number of users 
is large enough (reeall that the BF seheme ean also work when the number of users is larger 
than the number of transmit antennas, i.e., Cu > Ct). 


IV. Proof of Theorems 1 and 2 


The proof assumes high rate feedbaek from the mobiles to the BS and preeoding of the 
transmission to eaeh mobile using this feedbaek. The strueture of the suggested system is 
deseribed in the following Subseetion, leading to the proof of Theorem 1. The proof of Theorem 
2 is given in Subseetion IV-B. 

A. Proof of Theorem 1 

1) Uplink rate: As the first step, we eharaeterize the uplink eapaeity, in order to evaluate the 
available feedbaek rate. In an FDD seenario, the uplink gains the largest improvement, as the 
BS ean direetly estimate the uplink ehannels. Let Hy denote the uplink N x K ehannel matrix, 
the uplink reeeived signal is given by: 


yu = VpHuXu + nu. 


( 9 ) 


The aehievable user rates in the multiple aeeess ehannel were derived in [25] for various deteetors. 
For simplieity, we present here only the analysis of the linear MMSE reeeiver (whieh is obviously 
a lower bound on the uplink eapaeity). This MMSE bound is given by: 



( 10 ) 


where hu,i and Hyj denote the i-th eolumn of Hu and the remaining matrix with all other 
eolumns of Hu, respeetively. 

As N goes to infinity, and if limTv^oo K/N < 1, then [26]: 


lim 

N^OO 





( 11 ) 


N 


N^cio N 


and hence: 



( 12 ) 
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Thus the uplink can support any number of mobiles which grows at most linearly with N with a 
rate that grows logarithmically for each mobile. (In the following we choose the number of users, 
K, to grow only logarithmically with N, and hence the limit in (11) satisfies lim^r^oo % = 0.) 

2) Feedback quantization: As mentioned above, we assume that each mobile has perfect 
knowledge of the channels. Each mobile quantizes its measured channels and sends the index 
of the quantized version to the BS. We write the channel matrix as the sum: 

H = H + e (13) 


where H is the quantized channel known to the transmitter, while e is the channel error that is not 
known to the transmitter. For simplicity we assume an optimal, multi-dimensional, quantization 
scheme'*. Using rate distortion theory, the rate that is required to quantize the channel gains of 
the Nt antennas to a quantization mean square error (MSE) of E'[|eijp] = is [27], [28]: 


F^ = 




(14) 


where we have assumed that the feedback is sent only once for a coherence block of T symbols. 
Furthermore, by the asymptotic properties of the rate distortion function (e.g., [28] chapter 
10.3), each quantization error element, e* j, is a proper complex Gaussian variable with zero 
mean, which is statistically independent of the quantized channel, H, the transmitted symbols, 
X and every other element of e. Thus, we have: 



(15) 


3) Linear precoding: The transmitted vector is generated by the linear precoding: 


x = U^s 


(16) 


where U is precoding matrix and s is the JT x 1 vector that contains the actual data symbols, 
which are assumed to be iid proper complex Gaussian random variables with zero mean and 
unit variance. 

To derive a general result that includes both the BE and the ZF schemes as special cases, we 
consider a scheme that performs partial ZF, i.e., tries to zero the interference to L users, where 


^See [19] for an analysis of a simpler quantization scheme that can also he adapted to the problem discussed here. 
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Q < L < K — 1. All other degrees of freedom are used to inerease the received signals powers. 
Thus, each row of U is generated from the corresponding row in H by first zero-forcing the 
signal to L users and then normalizing its power. In mathematical terms, denoting by h* the i-th 
row of H and by Pj the projection matrix to the null space of the L selected interfered users, 
we define hj = h^Pj, and the Ath row of U is given by: 

hi 


u, = 


(17) 


Note that the ^/K term in the denominator satisfies the power constraint, (2), using i7[ss^] = I. 
The resulting effective channel is: 

y = ypHU^s + n. (18) 

Thus, the achievable rate of user i is lower bounded by: 


Ri > E 


log2 1 + 


P|hiuf 1 

"E[\si\ 

P] 

P 

hjuf 

\^E[\s,\^] + l 


(19) 


4) Rate balancing: In order to study the rate balancing tradeoff, we need to characterize the 
distribution of the numerator and the denominator of (19). For the random term in the numerator 
we use: 


hjuf = hjuf + CjU 

hiPihf 




\/AA/h,P,hf 


+ Ciuf 


h,hf Vh,p,hf 


H 


Vk 


hihf 


,_ TiVi -I- -;=Wi 


( 20 ) 


where rf has a chi-square distribution with 2Nt degrees of freedom, Vi has a beta distribution, 
Vi ~ (3{Nt — L, L), and Wi has a complex normal distribution with zero mean and a variance of 
1. For the denominator we have: 


|h*u 

k^i 


H\2 
k \ 


k^i,k((iCi 


H\2 
k \ 


E 

k^i,kCiCi 


EiUk 


H\2 




( 21 ) 
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where has a chi-square distribution with 2{K — 1 — L) degrees of freedom and gf has a 
chi-square distribution with 2L degrees of freedom, and Ci is the set of users that perform 
interference cancellation for user i. 

FT 

Using (14), we have = 2~~. Substituting together with (20), (21) and -^[Isjp] = 1 
into (19) gives: 


Ri > E 


( 


logs 


P 


1 + 




2\ n 




V 


^(p2 + 2 


J\ 


( 22 ) 


and taking the limit as p grows to infinity gives the interference limited rate: 


lim Ri > E 

p^OO 


( 


logs 


1 + 


\/l — R 


w. 


2 \ 1 




V 


p^,+2-^gf 


/} 


(23) 


If L < J-f — 1 then the limit of (23) is bounded as Ei grows to infinity. On the other hand, in 
the ZF case L = K — 1, and we get pi = 0 and: 


lim lim > lim —E 

Fi^OO p—^QO Fi^OO Fi 


. I H 

log2 1 + - 


2 ivt^2 


. ^ ft ^ 

= hm -— ——h E 

Fi^oo Ei y Nt 

T 

which completes the proof of Theorem 1. 


logs 


TiVi 


(24) 


B. Proof of Theorem 2 

1) Proof for linear precoding: We begin by proving Theorem 2 for the BF and ZF linear 
precoding schemes. We rewrite (19) as: 


Ri > E 


logs 1 + 


„ K |h,ufp 
PNt-LjFT) 


Nt 


K 


(25) 


Nt-L {l-E'f)Nt \P '^k^i ^ ^ 

and set the feedback rate to be Ei = Cf logs Ff. Considering the different terms in (25), we use 
the following limits: 


lim 


K Ih'U^P 

' * * ' = lim 


|hjhf -f ejhf p 


Nt-L^^ Nt-L{l- S2) Nt-L^oo (iVt - L)(l - S2)hihf 


= 1 


(26) 
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lim y IhjuFp = lim 

K^OO 


+ K-L 


k^i 


Nt^oo K 

= i-{i-e)P- 


(27) 


and 


lim 


N, 


(28) 


Nt^oo Nt- L I- PC 

where P = liniTv^oo C = liniiv^-oo and Cp = liniAr^-oo Note that the limit in (28) 
does not converge for /3 = ^ = 1. Thus, we limit the analysis to the case that < 1. In the 
following we will show that the performance significantly degrade sls PC approaches 1, and hence 
the limiting case ^ 1 is not interesting. Assuming that the bound $ = lim^^go 
exists, and substituting together with (26)-(28) into (25) gives 

P 


lim Ri > E 

N^OO 


logs 1 + 


^«I>-(p(l-/3(l-a) + l) 


(29) 


We next consider the two cases of the theorem: If Nt grows faster than a logarithmic function 
of N, we have C = 0, 


and 


CfT logo N 

c‘^ = lim 2 = 1 

N^oo 


a. r ^ 

$ = hm — • 

N—>-oo Nt 


(30) 


PjT 

2 


= lim a ■ 

a^O 


1 - 2 ' 




Cu log2 e 

CfT 


(31) 


where we substituted a = K/Nf Thus, (29) simplifies into (4). On the other hand, if Nt = 


CfT 


Ct log 2 N, we get C = cjct, C'^ = 2 -t , $ = 

( 


Cu/Ct 

-SiZ 

1-2 


and 


lim Ri > E 

Af —>00 


logs 


\ 


1 + 


p 


p(l_/3(i_s ))+l 


V 


1-^Cu/ct 


_£fz; 

1-2 


(32) 


which is easily simplified to (5) with A = 0. 
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The reader may note that the proof is more general than the theorem, as it allows the use of 
partial ZF for any number of users between 0 and K — 1, while the theorem only addresses the 
two extreme eases (/3 = 0 or /3 = 1). Interestingly, it is easy to show that partial ZF eannot lead 
to any advantage in the analyzed seenario, and will always be inferior to either the BF or the 
ZF seheme. To verify this point, one ean test the derivative of (29) with respeet to (3. The sign 
of this derivative satisfies: 

‘'lim"^°°~^. }-i^gn{p(l-a-C(l+p)} (33) 

whieh does not depend on {3. Thus, the optimal value of (3 ean only be one of its extreme values: 
0 (eorresponding to BF) or 1 (eorresponding to ZF). 

2) Proof for DP: To eomplete the proof of Theorem 2, we next present and analyze the DP 
preeoding seheme. More speeifieally, we eonsider the dirty paper seheme with zero foreing linear 
pre-proeessing (e.g., [29]). We use the LQ deeomposition of the quantized ehannel matrix: 

H = LQ (34) 



where L is a lower triangular matrix and Q is a unitary matrix, and apply a preeoding of the 
data symbols: 


x= 


s 

0 


(35) 


where x is the veetor of transmitted symbols and s is the K xl vector that contains the actual 
data symbols. Note that the precoding does not change the power of the resulting transmitted 
vector. Thus, using £'[ss^] = gives E[x^x] = £'[s^s] = 1. The resulting effective channel 
is: 


y = v^Ls + ^ex + n. (36) 

Recalling that L and s are known to the BSs while e is not known, we define: 

2—1 

Zi = '^ iijSj (37) 

Nt 

uJi = rii + ^/p'^ eijXj (38) 

i=i 
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and we can write: 


Vi = y/p^i,iSi + Zi + Wi. 


(39) 


where ii^i is the i-th element on the diagonal of the matrix L. The zero mean of and its 
statistical independence on H guarantee that E[siWi\ = E[ziWi\ = 0. Hence, we use the following 
lemma to lower bound the rate. 

Lemma 1: In the channel model of (39), if E[siWi\ = 0 then the channel capacity is lower 
bounded by: 



(40) 


Proof: Lemma 1 can be directly proved using Theorem 1 of [30], which guarantees the 
applicability of the dirty paper scheme even if the noise, Wi is not Gaussian, and possibly even 
statistically dependent on Sj and Zi. An alternative proof of this theorem, [31], also showed that 


bound can be achieved using a multidimensional lattice precoding. 
Thus we have: 



(41) 


where we usedi?[|sjp] = 1/JL. Using Lemma 2.1 of [32] (originally from [33]) 2|fj_jp/(l — S^) 
has a Chi square distribution with 2 (At — i + 1) degrees of freedom. 

To homogenize the system we use random permutations between the users. Thus, we get 
identical rates for all users. Denoting the rates before the randomization as Rf and averaging 
the achievable rate over all users, gives: 



k=l 




(42) 
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K c 2 _ ^ A 


t^oo jVj j s ^i “iiiu M/ iiiii^^QQ (1—S^)Aft 


where we used again ^ = limAr,^.oo s = 1 ™ 
substituted the bound of the diagonal elements of the matrix L for 0 < a < 1: 

1 ^; ■'' 


lim 


^kM 


N^-aK+l 

= lim - tt-= 1 — a(^. 


Nt-K^oo NJl -Ej) Nt^oo N, 

k=laK\ 


, and also 


(43) 


In the first ease of the theorem, Nt grows faster than a logarithmie funetion of N. From 
Sub-subseetion IV-Bl we have C = 0, = 1 and <h = In this ease, the integral trivially 

simplifies to (4). 


CfT 


For the seeond ease, we have Nt = ctlog 2 iV, whieh leads to C = Cu/ct, = 2 and 


^ • Thus, (42) becomes: 


1-2 


_CfT 


Ri> logs 1 + 


l- 2 "^p(l-a^) 


cfT 


Substituting m = (1 — a^) and 7 = p ^ ^ 


Cu/ct i + p2 


da. 


(44) 


1 

Ri> ^ J log 2 

^_ Cu 

ct 


Ct -f 7'«log2(e) + (1 + f 7 m) log 2 (l + f 7“) 
Cu 


Cu ' 


^_ Cu 

Ct 


(l + f;7) log2(l + f^7) 


7 


- log 2 (e) 


(1 + f:7(i -1)) iQg2(i + g:7(i -1)) 
7 


(45) 


Adding the term (1 + -^ 7 ) log 2 (l + — 7(1 — —)) in the last line of (45) and subtracting it from 

Cu Cu Ct 

the previous line simplifies the expression to: 


(1 + f^7) log 2 


Ri > 




7 

+ log2(l + —7(1 - —)) 

Cu Ct 


log 2 (e) 


(46) 


which simplifies to (5) and ( 6 ) with P = 1. 
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Fig. 1. Average spectral efficiency per user in the downlink of the massive MIMO system, as a function of the number of hits 
in feedback block. The system uses Nt — 8 transmit antennas and serves K = 8 users. 


V. Numerical examples 

Obviously, the linear increase of the rate with the feedback can only happen in an interference 
limited system. Thus, the figure shows that for SNR of p = 30dB the spectral efficiency flattens 
out around 6 and 8 Bps/Hz in the ZF and DP schemes, respectively. Yet, such a spectral efficiency 
is much better than what we have in current state of the art systems with the same SNR. 

To demonstrate the scaling of the rate with the number of antennas. Fig. 2 depicts the average 
spectral efficiency per user at the downlink as a function of the number of transmit antennas. 
For simplicity, this figure depicts only the performance of the BF system for the case = 4, 
CfT = 10, i.e., the number of users is set to the integer that is closest to 4 ■ log 2 (iV) and the 
number of feedback bits takes ^ of the uplink rate of each user (e.g., if the coherence block 
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Fig. 2. Average spectral efficiency per user in the downlink of the massive MIMO system, as a function of the number of 
receive antennas (Cu = 4, CfT = 10) THIS SHOULD APPEAR IN TEXT ALSO. The figure depicts the performance of a 
beamforming (BE) system, for various scalings of the number of antennas used for transmission. 


contains 180 symbols as in the example of [19], then approximately 5.6% of the uplink is devoted 
to feedbaek). 

The figure shows three different eurves for different ehoiees of the number of transmit 
antennas. The 2 lower curves correspond to logarithmic scaling of the number of transmit 
antennas with the number of reeeive antennas, while the highest eurve uses all of the available 
antennas for transmission. In all eases, the simulated speetral effieieney (per user) eonverges to 
the asymptotie results of Theorem 1 (when the number of antennas is large enough). Thus, the 
system sum rate seales logarithmieally with the number of antennas. This logarithmie sealing 
is much slower than the linear sealing of the uplink. Yet, it ean still give signifieant throughput 
gains. As an example, the figure give more details for the case of 4 and 128 antennas (note 
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Fig. 3. Total spectral efficiency in the downlink of the massive MIMO system, as a function of the number of receive antennas. 
The number of served user is 7T = Cu logj N with Cu = 4. 


that the number of transmit antennas is upper bounded by the total number of antennas). A 
eomparison of these two points shows that increasing the number of antennas from 4 to 128 
results in a multiplication of the sum-rate by more than 5 for all three setups. 

To further demonstrate the massive MIMO gain, Fig. 3 depicts the sum of all user rates 
as a function of the number of receive antennas for the three analyzed schemes. As can be 
seen, in all cases we have a logarithmic growth of the system throughput with the number of 
antennas. Furthermore, as predicted by (4), when all antennas are used for transmission (At = A) 
all schemes converge to the same performance. As stated above, it is more practical to limit 
the number of transmit antennas. Hence, it is good to observe that logarithmic scaling is also 
obtained using a logarithmic number of antennas. Note that all antennas are required to be used 
for reception. This is in order to maintain the logarithmic growth of the uplink rate with the 
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number of antennas, which in turn is needed to support the required feedback rate. 

VI. CONCLUSIONS 

This work studied the extension of the uplink-downlink balancing principle to FDD massive 
MIMO systems. The uplink-downlink balancing concept is based on using part of the uplink 
capacity for CSI feedback in order to increase the downlink capacity. We extended the proof 
of the balancing principle, to the case of a massive MIMO system that performs linear zero¬ 
forcing, showing that any unused uplink rate can be converted to an increase in the downlink 
rate with a favorable exchange ratio. The possibility to tradeoff uplink rate for downlink rate 
is very important in FDD massive MIMO systems, as the use of many receive antennas yields 
significant gains for the uplink throughput. 

We further study the case in which a constant fraction of the uplink rate is devoted to CSI 
feedback, and show that the downlink sum-rate can scale logarithmically with the number of 
antennas. This scaling is achieved by scaling the number users logarithmically with the number 
of transmit antennas and achieving a constant throughput for each user. While this scaling is 
lower than the uplink rate scaling or the TDD rate scaling, it still brings good throughput gains. 

The practicality of the suggested approach is further emphasized by showing that the logarith¬ 
mic scaling is achievable even when only using a small fraction of the antennas for transmission. 
More specifically, the number of transmit antennas is required to only scale logarithmically with 
the number of receive antennas. This limits the number of downlink pilot symbols needed, which 
significantly improves the system efficiency. 
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